filmov
tv
understanding momentum in stochastic gradient descent